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Site-Specific Recombination in Eukarvotes 
and Constructs Useful Therefor 



FIELD OF THE INVENTION 



10 



The present invention relates to methods for 
manipulating chromosomal sequences in cells by site- 
specific recombination promoted by recombinases . In a 
particular aspect, the present invention relates to methods 
for producing embryonic stem cells bearing nucleic acid 
sequences that have been rearranged by a site -specif ic 
I euuiiibiiiciye expressed from a construct controlled by a 
tissue-specific promoter (e.g., a germline specific 
promoter) . In another aspect, the present invention 
relates to methods for producing embryonic stem cells 
bearing nucleic acid sequences that have been rearranged by 
a si te - specif ic recombinase expressed from a construct 
controlled by a conditional promoter. 
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BACKGROUND OF THE INVENTION 
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The analysis of gene function has increasingly come to 
require the production of subtle, tissue-specific, and 
conditional mutations in animals and plants. Although 
there are a number of methods for engineering subtle 
mut£itions in embryonic stem (ES) cells (Hasty et al. (1991) 
Nature 350:243- 246, Askew et ai . (1993) Mol Cell Biol 
13:4115-4124), the use of si te - specif ic recombinases to 
remove the selectable marker that permits isolation of 
homologously recombined ES cell clones has become 
increasingly prevalent (Kitamoto ct ai . (1996) Biochew 
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Brain Sciences, eds . Nakanishi et al . (Japan Scientific 
Press) , pp. 89-98) . 



Site-specific recombinases represent the best method 
for creating tissue- specif ic and conditional mutations in 
5 animals and plants, being employed first to remove the 
selectable marker to create a functionally wild-type 
allele, and then to inactivate the allele mosaically in 
animals and plants by removing some essential component in 
a tissue-specific or conditional manner (Gu et al , (1994) 

10 Science 26b : 103 - 106 ; Kuhn et al . (1995) Science 
269:1427-1429). Current protocols for using excissive 
site-specific recombination to remove selectable markers 
include transiently transfecting ES cell clones with a 
recombinase expression vector (Gu et al . (1993) Cell 

15 73:1155-1164) , microin j ect ing fertilized oocytes containing 
the recombinant allele with a recombinase expression vector 
(Kitamoto et al . (1996) Biochern Biophys Res Comwun 
222:742-747; Araki et al . (1995) Proc Natl Acad Sci USA 
92:160-164), or breeding animals and plants containing the 

20 recombinant allele to animals and plants, respectively, 
containing a recombinase transgene (Schwenk et al . (1995) 
Nucleic Acids Res 23:5080-5081; Lewandoski et al . (1997) 
Curr Biol 7:148-151) . Each of these approaches requires an 
investment of some combination of time, resources, and 

25 expertise over that required to generate animals and plants 
with homologously recombined alleles. The most commonly 
employed method, the secondary transfection of homologously 
recombined ES cell clones with a recombinase expression 
vector, additionally requires extended culture time that 
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recombinase nucleic acid constructs that were expressed in 
the germline, but not to an appreciable extent in the ES 
cells themselves or somatic tissues of animals and plants. 
The lack of ES cell expression would mean that targeting 
5 vectors containing selectable markers flanked by 
recombinase target sites could be used to isolate 
homologous recombinants without fear that the marker would 
be excised during culture. Robust recombinase expression 
in gametes would mean that the marker would be excised in 

10 at least some of the progeny of ES cell chimeras. Only a 
single step would be required to isolate subtle mutations 
and, if two different recombinase systems were employed, 
conditional and tissue-specific alleles could be produced 
with similar improvements in efficiency. A 

15 germline- specif ic recombinase nucleic acid construct could 
also be used to deliver recombined target nucleic acid 
constructs to the early embryo (Lewandoski et al . (1997) 
Curz Biol 7:148-151), so long as the recombined target was 
not detrimental to the terminal stages of spermatogenesis. 

20 Previous reports have shown that expression of nucleic 

acid constructs containing the proximal promoter of the 
mouse protamine 1 (mPl) locus is restricted to haploid 
spermatids in mature mice (Peschon et al . (1987) Proc Natl 
Acad Sci U S A 84:5316-5319; Behringer et al . (1988) Proc 

25 Natl Acad Sci USA 85:2648-2652), although low levels of 
ectopic expression may occur in some mature tissues 
(Behringer et al. (1988) Proc Natl Acad Sci USA 
85:2648-2652). Inclusion of the mPl promoter does not 
guarantee expression in the male germline, however, for 
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levels in spermatids (Behringer et al . (1988) Proc Natl 
Acad Sci USA 85:2648-2652). 

Accordingly, there is a need in the art for methods to 
modulate expression of recombined target nucleic acid 
5 sequences in the early embryo. In addition, there is a 
need in the art for tissue-specific and conditional 
recombinatory tools to create transgenic animals and 
plants. These and other needs in the art are addressed by 
the present invention. 

10 BRIEF DESCRIPTION OF THE INVENTION 

The present invention meets the need in the art for 
modulating expression of recombined target nucleic acid 
sequences to the early embryo. The present invention 
further meets the need in the art for tissue-specific and 

15 conditional recombinatory tools to create transgenic 
animals and plants. Thus, in accordance with the present 
invention, it has been discovered that nucleic acid 
constructs encoding a germline specific promoter 
operatively associated with a recombinase coding sequence 

20 lead to efficient recombination of a target nucleic acid 
construct in the male germline, but not in other tissues. 
This suggests that such nucleic acid constructs could be 
used for the efficient production of embryos bearing 
conditional, genetically lethal alleles. It has 

25 additionally been discovered that ES cell lines generated 
from one of these transgenic lines could be used in 
combination with targeting vectors that contained 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 illustrates a schematic of P2Bc and P2Br 
alleles. The positions of the PGR primers used (5'P and 
3'P) are indicated on the diagrams of the P2Bc and P2Br 
5 alleles. 

Figure 2 depicts the targeting of the hoxb-1 locus in 
ProCre ES cells using a targeting vector that contains a 
loxP" flanked selectable marker. Top, schematic of the 
wild- type hoxb-1 locus showing the positions of the two 

10 exons (open boxes), the position of a 5 ' Nrul site and 
flanking BamHI restriction endonuclease sites, and PGR 
primers (triangles) that amplify a 204 bp product from the 
wild-type allele that includes the Nrul site. Middle, the 
predicted organization of homologously recombined hoxb-1 

15 allele in which a neomycin cassette (NEO) , flanked by loxP 
sites (L) , has been inserted into the Nrul site shown in 
the top diagram. The insertion creates a novel BamHI site 
and the same PGR primers now amplify a 1600 bp product. 
Bottom: the predicted structure of the recombined allele 

20 shown in the middle panel after Cre-mediated excision of 
the neomycin cassette to leave a single loxP site in place 
of: the Nrul site of the wild-tyT:)e allele. Amplification 
with the same primers now yields a 268 bp product. 

DETAILED DESCRIPTION OF THE INVENTION 

25 In accordance with the present invention, there are 

provided nucleic acid constructs comprising a germline- 
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As used herein, the term "promoter" refers to a 
specific nucleotide sequence recognized by RNA polymerase, 
the enzyme that initiates RNA synthesis. The promoter 
sequence is the site at which transcription can be 
5 specifically initiated under proper conditions. The 
recombinase nucleic acid(s), operatively linked to the 
suitable promoter, is (are) introduced into the cells of a 
suitable host, wherein expression of the recombinase 
nucleic acid(s) is (are) controlled by the promoter. 

10 Germline-specif ic promoters contemplated for use in 

the practice of the present invention include the protamine 
1 gene promoter, the protamine 2 gene promoter, the 
spermatid-specif ic promoter from the c-kit gene (Albanesi 
et al . (1996) De\^elopment 122 (4) : 1291-1302) , the sperm- 

15 specific promoter from angiotensin-converting enzyme 
(Howard et al . (1993) Mol Cell Biol 13(l):18-27; Zhou et 
al . (1995) Dev Genet 16 (2) : 201-209) , oocyte specific 
promoter from the ZPl gene, oocyte specific promoter from 
the ZP2 gene, oocyte specific promoter from the ZP3 gene 

20 (Schickler et al . (1992) Mol Cell Biol 12 (1) : 120 - 127 ) , and 
the like. 

In addition to the above- desci'ibed germline - specif 1 c 
promoters, tissue-specific promoters specific to plants eire 
also contemplated for use in the practice of the present 
25 invention, including, for example, the LAT52 gene promoter 
from tomato, the LAT56 gene promoter from tomato, the LAT5 9 
gene promoter from tomato Eyal et al . (1995) Plant Cell 
7 (3) :373-384) , the pol len- speci f ic promoter of the Brassica 
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Recombinases contemplated for use in the practice of 
the present invention include Cre recombinase , FLP 
recombinase , the R gene product of Zygosaccharomyces 
(Onouchi et al . (1995) Mol Gen Genet 247 (6) : 653 -660 ) , and 
5 the like. 

Presently preferred constructs contemplated for use in 
the practice of the present invention include ProCre 
(comprising the protamine 1 gene promoter operatively 
associated with Cre recombinase) , ProFLP (comprising the 
10 protamine l gene promoter operatively associated with FLP 
recombinase) , ProR (comprising the protamine 1 gene 
promoter operatively associated with the R gene product of 
Zygosaccharomyces), and the like. 

In accordance with another embodiment of the present 
15 invention, there are provided nucleic acid constructs 
comprising a conditional promoter or a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 

Promoters contemplated for control of expression of 
20 recombinase nucleic acid(s) employed in accordance with 
this aspect of the present invention include inducible 
(e.g., minimal CMV promoter, minimal TK promoter, modified 
MMLV LTR) , constitutive (e.g., chicken 3-actin promoter, 
MMLV LTR (non-modified) , DHFR) , and/or tissue specific 
2 5 promoters. 

Conditional promoters contemplated for use in the 
cj: suitabi'i ^naucible promoter :j :nc:l\uit;> DMA sequences 
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corresponding to: the E. coli lac operator responsive to 
IPTG (see Nakamura et al . , Cell, 18:1109-1117, 1979); the 
metal lot hi one in promoter metal - regulatory- elements 
responsive to heavy-metal (e.g., zinc) induction (see Evans 
5 et al., U.S. Patent No. 4,870,009), the phage T7lac 
promoter responsive to IPTG (see Studier et al . , Meth. 
Enzymol., 185: 60-89, 1990; and U.S. #4,952,496), the heat- 
shock promoter; the TK minim.al promoter; the CM\^ minimal 
promoter; a synthetic promoter; and the like. 

10 Exemplary constitutive promoters contemplated for use 

in the practice of the present invention include the CMV 
promoter, the SV4 0 promoter, the DHFR promoter, the mouse 
mammary tumor virus (MMTV) steroid- inducible promoter, 
Moloney murine leukemia virus (MMLV) promoter, elongation 

15 factor la (EFla) promoter, albumin promoter, APO Al 
promoter, cyclic AMP dependent kinase II (CaMKII) promoter, 
keratin promoter, CD3 promoter, immunoglobulin light or 
heavy chain promoters, neurof iliment promoter, neuron 
specific enolase promoter, L7 promoter, CD2 promoter, 

20 myosin light chain kinase promoter, HOX gene promoter, 
thymidine kinase (TK) promoter, RNA Pol II promoter, MYOD 
promoter, MYF5 promoter, phophoglycerokinase (PGK) 
promoter, Stfl promoter, Low Density Lipoprotein (LDL) 
promoter, chicken B -act in promoter (used in conjunction 

25 with ecdysone response element) and the like. 

As readily understood by those of skill in the art, 
the term "tissue specific" refers to the substantially 
exclusive initiation of transcription in the tissue from 
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the practice of the present invention include the GH 
promoter, the NSE promoter, the GFAP promoter, 
neurotransmitter promoters (e.g., tyrosine hydroxylase, TH, 
choline acetyltransf erase , ChAT, and the like) , promoters 
5 for neurotropic factors (e.g., a nerve growth factor 
promoter, NT-3, BDNF promoters, and the like), and so on. 

In accordance with yet another embodiment of the 
present invention, there are provided embryonic stem cells 
containing a nucleic acid construct as described herein. 

10 As readily understood by those of skill in the art, 

the above -described constructs can be introduced into a 
variety of animal species, such as, for example, mouse, 
rat, rabbits, swine, ruminants (sheep, goats and cattle), 
humans, poultry, fish, and the like. Transgenic 

15 amphibians, insects, nematodes, and the like, are also 
contemplated. Members of the plant kingdom, such as, for 
example, transgenic mono- and dicotyledonous species, 
including important crop plants, i.e., wheat, rice, maize, 
soybean, potato, cotton, alfalfa, and the like, are also 

2 0 contemplated. 

For example, pluripotent iai ES cells can be derived 
from early pre- implantation embryos, preferably the ov^a ar-e 
harvested between the eight-cell and blastocyst stages. ES 
25 cells are maintained in culture long enough to permit 
integration of the promoter- recombinase nucleic acid 
construct ( s) . The cells are then either injected into a 
host blastocyst, i.e., the blastocoel of the host 
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transgenic offspring are termed "chimeric, " as some of 
their cells are derived from the host blastocyst and some 
transfected ES cells. The host embryos are transferred 
into intermediate hosts or surrogate females for continuous 
5 development. 

The transformation procedure for plants usually relies 
on the transfer of a transgene carrying a particular 
promoter construct via the soil bacterium -A^rroJbac ter ium 
tumef aciens . Transformation vectors for this procedure are 

10 derived from the T-DNA of A. tumefaciens , and transgenes 
are stably incorporated into the nuclear genome. The 
activity of the transgenes can then be monitored in the 
regenerated plants under different conditions. In this 
way, many promoter elements that are involved in complex 

15 regulatory pathways such as light responsiveness or tissue 
specificity have been defined. 

Alternatively, direct (i.e., vectorless) gene 
transfer systems are also contemplated including chemical 
methods, electroporation, microinjection, biolistics, and 

20 the like. Protoplasts isolated from the plants can be 
obtained by treatment with cell wall degrading enzymes. 
DNA can be introduced into plant protoplasts by a number of 
physical techniques including electroporation and 
polyethylene glycol treatment in the presence of MgCl;, . 

25 The method of choice for rapid promoter analyses in plants 
is the biolistic method. This technique involves the 
delivery of the particular DNA construct into plant cells 
by micropro j ect iles , i.e., nucleic acid(s) coated or 
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of transformation if appropriate selectable markers are 
included . 

In a preferred embodiment, the genome of embryonic 
stem cells according to the invention comprise a 
5 transcriptionally active selectable marker flanked by two 
recombination target sites. It is especially preferred 
that the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline- specif ic 
promoter is selective for the recombination target sites 
10 flanking said selectable marker. 

Optionally, embryonic stem cells according to the 
invention may further comprise one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
15 different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence , 

20 a second nucleic acid construct comprising a tissue- 

specific promoter operatively associated with a second 
recombinase coding sequence, or the like. Preferably, the 
second recombinase coding sequence will be different than 
the first recombinase coding sequence. 

25 The ability to select and maintain nucleic acid 

constructs in the host cell is an important aspect of an 
expression system. The most common type of selectable 



puromycin, blastophyc in , and tho. like. Other approaches 
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employ specially constructed host cells which require the 
selectable marker for survival. Such selectable markers 
include the valine tRNA synthetase, val S, the 
single-stranded DNA binding protein, ssh, thymidine kinase, 
5 or the like. Alternatively, naturally occurring partition 
systems that maintain copy number and select against 
plasmid loss is also contemplated. An example is the 
incorporation of the parB locus. Other selectable markers 
include HPRT and the like. 

10 Selectable markers specific for plants include, the 

gus A (uid A) , the Jbar gene, phosphinothricin and the like. 

In accordance with still another embodiment of the 
present invention, there are provided methods for excission 
of the transcriptionally active selectable marker from the 
15 above -described embryonic stem cells, said method 
comprising : 

passaging the genome derived from said embryonic stem 
ceils through gametogenesis (i.e., spermatogenesis or 
oogenesis) . 

20 Excission of marker as contemplated herein can cause 

a variety of end results, e.g., deletion of the marker or 
a nucleic acid sequence, gain of function or loss of 
function, replacement of function, and the like, as well as 
modulation of any one or more of these results. 

25 Functions which are contemplated to be manipulated 

include regulating body size and growth rate, including 
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resistance to viral and bacterial diseases (i.e., 
"constitutive immunity" or germ-line transmission of 
specific, recombined antibody genes) , more efficient wool 
production, and the like. Other functions which are 
5 contemplated to be modulated include development of lines 
of transgenic animals and plants for use in directing 
expression of transgenes encoding biologically active human 
proteins . 

Agronomic traits which are contemplated to be 
modulated by use of the present invention include tolerance 
to biotic an abiotic stress, increased resistance to 
herbicides, pest damage, and viral, bacterial, and fungal 
diseases, improvement of crop quality {i.e., increase in 
nutritional value of food and feed) , reduction of post- 
harvest losses, improvement of suitability and enlargement 
of the spectrum for processing (i.e., altered quantity and 
composition of endogenous properties, production of new 
compounds of plant or non-plant origin such as biopolymers 
or pharmaceutical substances) . 

20 In accordance with a still further embodiment of the 

present invention, there are provided methods for the 
production of recombinant alleles, said method comprising: 
introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
25 cells as described herein, and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 
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random insertion, retroviral insertion, site specific- 
mediated recombination, and the like. 

Nucleic acid fragments contemplated for use herein 
include fragments containing an essential portion of a gene 
5 of interest. 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
production of recombinant alleles, .s^id method comprising: 

introducing at least one recombinase responsive construct 
10 into embryonic stem cells as described herein, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 
15 wherein said nucleic acid fragment is flanked by 

a second pair of recombination target sites, 

passaging the genome derived from said embryonic stem cells 
through gametogenesis . 

In a presently preferred aspect:, the first pair of 
20 recombination target sites is recognized by a recombinase 
which is expressed under the control of a germline - specif ic 
promoter and said second pair of recombination target sites 
is recognized by a recombinase which is expressed under the 
control of a conditional promoter or a tissue specific 
2 5 orom.oter. 

■ i\. :a.:.Lr:ei :v.;tiu i; i i :--r;c.)i::: ic:::.: ^- 'oiist. !:"uct 

selected from constructs comprising a conditional promoter 
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operatively associated with a recombinase coding sequence, 
a construct comprising a tissue-specific promoter 
operatively associated with a recombinase coding sequence, 
and the like. 

5 In accordance with still another embodiment of the 

present invention, there are provided methods for the 
conditional assembly of functional gene(s) for expression 
in eukaryotic cells by recombination of individual inactive 
gene segments from one or more gene{s) of interest, 
10 wherein each of said segments contains at least one 

recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 

said method comprising: 

15 introducing said individual inactive gene 

segments into an embryonic stem cell as described 
herein, thereby providing a DNA which encodes a 
functional gene of interest, the expression product of 
which is biologically active, upon passage of the 

20 genome derived from said stem cells through 

gametogenesis. 

For assembly of functional genes from inactive gene 
segments, see, for example, US Patent No. b, 654, 182, 
incorporated herein by reference in its entirety. 

25 In accordance with a still further embodiment of the 

present invention, there are provided methods for the 
generation of recombinant livestock, said method 
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pluripotential ES cells derived from early pre-implantation 
embryos , and 

introducing these combined embryos into a host female 
and allowing the derived embryos to come to term. 

5 

In accordance with yet another embodiment of the 
present invention, there are provided methods for the 
generation of recombinant plants, said method comprising 
transforming plant zygotes with nucleic acid constructs 
10 according to the invention and allowing the zygote to 
develop . 

The objective of the current work with ProCre nucleic 
acid constructs was to determine the potential of 
germline-specif ic promoters to implement efficient 

15 approaches utilizing site-specific recombinases to generate 
an array of sophisticated mutations in mammals and plants. 
The data shows that it is possible to create recombinase 
nucleic acid constructs that are expressed at high levels 
in the germ line but not to a functionally significant 

20 extent in either ES cells or embryonic or adult somatic 
tissues. Homologous recombinants with a selectable marker 
can be isolated in ES cells that contain 
promoter -recombinase nucleic acid constructs. Transgenic 
animals and plants bearing the promoter-recombinase nucleic 

25 acid constructs and a target allele transmit the recombined 
target to their progeny at high frequencies. These results 
establish the principle that mammals and plants containing 
loci that have been homologously recombined and then 
subsequently site-specifically recombined can be generated 
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progeny of ES cell chimeras without any investment of time, 
expertise, or resources over that required to create an 
allele that still contains a selectable marker. The 
paradigm has obvious utility in the production of subtle 
5 and conditional mutations that require generation of 
alleles with minimal structural alterations. Because the 
presence and transcriptional activity of selectable markers 
can contribute to phenotypes in an unanticipated and 
unwanted manner (Fiering et al . (1995) Genes Dev 
10 9:2203-2213); Olson et al . (1996) Cell 85:1-4), the 
approach will also useful for generating null alleles. 

Expression of the endogenous mPl locus (Hecht et al . 
(1986) Exp Cell Res 164:183-190), and mPl-driven nucleic 
acid constructs (Behringer et al . (1988) Proc Natl Acad Sci 

15 [7 5 A 85 :2648-2652 ; Braun et al . (1989) Na ture 337 : 373 - 376 ; 
Zambrowicz et al , (1993) Proc Natl Acad Sci USA 
90:5071-5075) is restricted to haploid spermatids. 
Expression of mPl nucleic acid construct expression 
typically begins at haploid stages, and both RNA (Caldwell 

20 and Handel (1991) Proc Natl Acad Sci USA 88:2407-2411) 
and proteins (Braun et al . (1989) Nature 337:373-376) 
diffuse through the spermatogenic syncytium. The result is 
a highly efficient recombination of target alleles and the 
segregation of recombinase and target nucleic acid 

25 constructs in the first generation. 

Cre-mediated recombination proved to be highly testis- 
specific in ProCre mice. It is clear that the nucleic acid 
constructs are not expressed in the inner cell mass or in 



(1989) Development 106:37-46; Soriano and Jaenisch (1986) 
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Cell 46:19-29). If ProCre nucleic acid constructs 

recombined target sequences during pre- implantation stages, 
at least a few percent of the cells in many tissues would 
contain the P2Br allele and Southern and PGR analyses 
5 showed that this was not the case. The ectopic Cre 
activity seen in some ProCre strains probably resulted from 
low levels of recombinase expression in later embryos or 
mature tissues, a finding consistent with the expression 
patterns of other mPl-driven nucleic acid constructs. 

10 Northern analyses have failed to reveal the expression of 
mPl -containing nucleic acid constructs in a variety of 
mature tissues (Peschon et al . (1987) Proc Natl Acad Sci 
USA 84:5316-5319; Behringer et al . (1988) Proc Natl Acad 
Sci USA 85:2648-2652; Peschon et al . (1989) Ann N Y Acad 

15 Sci 564:186-197; Zambrowicz et al . (1993) Proc Natl Acad 
Sci USA 90:5071-5075), but nucleic acid constructs 
containing the mPl promoter and the SV40 T-antigen led to 
the consistent development of tumors of the petrosal bone 
and right cardiac atrium (Behringer et al . (1988) Proc Natl 

20 Acad Sci USA 85:2648-2652) . 

PCR assays represent a very sensitive assay for 
whether sufficient levels of Cre protein were produced to 
effect recombination. Importantly, they measured the 
cumulative level of recombination, for events that occurred 
25 at any stage of development are likely to have been 
propagated to, and might be amplified in, descendant 
populations. The highest level of ectopic recombination 
was that observed in cardiac ventricular tissue of 
strain which generated a signal approximately equivalent to 
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strains showed evidence of recombination in the cardiac 
atria and the petrosal bone was not examined. These assays 
did not rule out the possibility that higher levels of 
recombination occur in tissues that were not examined or 
5 that the low levels of recombination observed in some 
tissues reflected high levels of recombination in some 
component cell population. 

These low levels of ectopic activity suggest that 
mpl -driven recombinase nucleic acid constructs could be 

10 used for the production of embryos containing genetically 
lethal alleles. Some alleles created by homologous 
recombination in ES cells will prove to be lethal in 
heterozygotes , as was the case for an mRNA editing mutation 
of the GluR2 glutamate receptor subunit (Brusa et al . 

15 (1995) Science 270:1677-1680) . Germline transmission would 
be restricted to rare chimeras in which the level of 
chimerism was low enough in tissues affected by the 
mutation to allow survival and high enough in the germline 
to allow transmission. This problem could be circumvented 

20 by creating recombinase -conditional mutations in ES cells 
bearing mpl -recombinase nucleic acid constructs, or by 
making the same mutations in standard ES cells and then 
introducing the mpl -recombinase nucleic acid construct by 
breeding. So long as the recombined version of the allele 

2 5 did not adversely impact terminal stages of 
spermatogenesis, embryos containing the recombined allele 
could be efficiently produced. Embryos containing 

recombined nucleic acid constructs can also be produced 
through the activity of Cre nucleic acid constructs that 



Dromotcr (Lakso et al. (1992) Prcc Natl Acad Sci U S A 

A- 
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89:6232-6236), or the zP3 promoter (Lewandoski et al . 
(1997) Curr Biol 7:148-151). ProCre and zP3 nucleic acid 
constructs have the advantage of delivering a recombined 
allele to the zygote, guaranteeing that all cells in the 
5 derived embryos will contain the allele. 

ProCre ES cells are but one of many different kinds of 
recotnbinase-bearing ES cells that could significantly 
shorten the time and effort required for a wide variety of 
genetic manipulations in mice. The most obvious of these 

10 are complementary ProFLP ES cells in which the FLP 
recombinase was derived from S. cerevisae (Broach and Hicks 
(1980) Cell 21:501-508) or another species (Kuhn et al . 
(1995) Science 269:1427-1429). Conceptually distinct from 
these but perhaps as generically useful would be ES cells 

15 bearing inducible recombinase nucleic acid constructs that 
would facilitate temporal control of recombinase expression 
in ES cells, chimeras, and their progeny to generate 
site-specifically recombined alleles (Araki et al . (1992) 
J Mol Biol 225:25-37; No et al. (1996) Proc Natl Acad Sci 

20 USA 93:3346-3351; Logie and Stewart (1995) Proc Natl Acad 
Sci USA 92:5940-5944; Fell et al . (1996) Proc Natl Acad 
Sci USA 93:10887-10890) . Finally, fusion genes that led 
to recombinase expression in specific tissues could be used 
to address specific research objectives. 

25 The invention will now be described in greater detail 

by reference to the following non- limiting examples. 

Example 1 



30 Peschon er. al . (1989) Annals of the New York Academy of 
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Sciences 186-197) was isolated by PGR using PGR primers 
(SEQ ID NOs : 2 and 3) and genomic DNA templates from GGE ES 
cells (Robertson et al . (1986) Nature 323:445-448). This 
fragment was fused to a modified Cre coding sequence (SEQ 
5 ID NO: 4) which contains a consensus translation start site 
(Kozak (1986) Ceil 44:283-292), 11 codons for a human c-myc 
epitope (Evan et al . (1985) Mol Cell Biol 5:3610-3616), 
7 codons for a minimal SV40 nuclear localization signal 
(Kalderon et al . (1984) Cell 39:499-509) and the 

10 polyadenylation signal from pIC-Cre in the plasmid pOG304M 
(SEQ ID NO:5). The Gre expression plasmid pOG231 was 
prepared by fusing a modified Cre coding sequence from 
pIC-Cre (Gu et al . (1993) Celi 73:1155-1164), and 
containing the same translation start and nuclear 

15 localization signal, to the synthetic intron and Civrv 
promoter of pOG44 (0' Gorman et al . (1991) Science 
251:1351-1355) . 

A plasmid, pOG277 (SEQ ID NO : 7 ) , containing a 
loxP- flanked neomycin cassette was prepared by inserting a 

20 wild-type loxP site (SEQ ID NO: 8; Hoess et al . (1982) Proc 
Natl Acad Sci USA 79:3398-402) into pBSKS (Stratagene) 
and then cloning the neomycin expression cassette from 
pMClneo-polyA (Thomas et al . (1987) Cell 51:503-512) 
between interactions of this loxP site. The hoxb-1 

25 targeting construct consisted of the PGK-TK cassette from 
pPNT (Tybulewicz et al . (1991) Cell 65:1153-63), and 1.4kb 
and 10.2kb of sequences 5' and 3' to an Nru I site 800 bp 
5' to the hoxb-1 transcriptional start site isolated from 
a 129 strain genomic library (Stratagene) , The 
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J. Biol. Chem. 262:10695-10705) to create the P2Bc allele 
(Figure 1) . Cre-mediated recombination of the P2Bc allele 
results in the deletion of the neomycin cassette (Neo) of 
P2Bc that is flanked by two loxP sites, leaving a single 
5 loxP site and fusing the B-Gal coding sequence to the 
initial codons of the RNA polymerase II coding sequence. 
Recombination increases the size of a Pst I fragment 
recognized by the RP2 probe, which is external to the 
targeting vector used, indicated by the shaded box below 
10 each allele. 

Example 2 
Production of transgenic mice 

Fertilized oocytes obtained from matings of 129/SvJae 
(Simpson et al . (1997) Nat Genet 16:19-27) and BALB/c X 

15 C57BL/6 Fl mice were used for pronuclear injections of the 
Protamine-Cre fusion gene from pOG304M according to 
standard protocols (Hogan et al . Manipulating the Mouse 
Embryo: The Manual, Coldspring Harbor Press (1994), pg , 
497). Production of ES cells and homologous recombinants: 

20 Heterozygous ProCre 129/SvJae males were mated to 
12 9/SvEms-4-'^*-^VJ females (Simpson et al . (1997) Wat Genet 
16:19-27) to produce blastocysts that were cultured 
according to standard protocols (Robertson (1987) 
Teratocarcinomas and enabryonic stem cells, a practical 

25 approach, eds . E. J. Robertson (IRL Press), pp. 71-112). 
The sex (King et al . (1994) Genomics 24:159-68) and ProCre 
status of each line were determined by PGR assays. 



v:re activity used 100 ng o: genomic DNA as a compla:..*.' to 



amplify a P2Br- specif ic product using a 5' primer from the 
RP2 promoter and a 3' primer from the P-GAL coding sequence 
(Figure 1) . Thirty cycles of amplification were done in a 
total volume of 100 fxl using 300 ng of each primer, 3 mM 
MgCl2, 1.5 units of Taq polymerase, and an annealing 
temperature of 60 °C. Southern blots of reaction products 
were hybridized with a probe specific for the P2Br reaction 
product . 

Example 3 

ProCre Nucleic Acid Constructs Efficiently Recombine 

Target Alleles 

A total of nine founder animals with ProCre nucleic 
acid constructs were obtained from injections of a 
Protamine-Cre fusion gene. Two lines were derived from 
injections of 129SvJae (Simpson et al. (1997) iVat Genet 
16:19-27) embryos, and seven from injections of CB6F2 
embryos. The 129/SvJae lines and three randomly selected 
hybrid lines were examined in detail. To determine whether 
ProCre nucleic acid constructs would efficiently recombine 
a target allele, males were generated that contained a 
ProCre nucleic acid construct and a target for Cre-mediated 
recombination. This "P2Bc" (Pol ii-GAL, conditional) 

target (Figure 1) was created using homologous 
recombination in ES cells to insert a loxP- flanked neomycin 
cassette and a [5-GAL coding sequence into the first exon of 
the locus coding for the large subunit of RNA 
polymerase II. Cre-mediated recombination of the loxP 
sites was expected to delete the intercalated sequences, 
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determine if they inherited the P2Bc or the P2Br allele, 
and to additionally determine the segregation pattern of 
ProCre nucleic acid constructs and P2Br alleles. Southern 
blot of Pst I digested tail biopsy DNA's from a +/P2Bc, 
5 -f/ProCre male (sire) and four of his progeny by a wild-type 
female probed with n RP2 probe (top) and then reprobed with 
a Cre probe (bottom) . The large majority of transmitted 
target alleles were Cre-recombined P2Br alleles (Table 1) . 
ProCre nucleic acid constructs and recombined target 

10 alleles segregated independently in the first generation; 
approximately 50% of mice that inherited a P2Br allele also 
inherited their male parent's ProCre nucleic acid 
construct. All RP2 mutant alleles in the progeny were 
P2Br, and some progeny inherit a P2Br allele without 

15 inheriting ProCre nucleic acid construct. Mouse 4 did not 
contain a ProCre nucleic acid construct and is homozygous 
wild-type at the RP2 locus. These data establish that 
ProCre nucleic acid constructs efficiently recombine the 
P2Bc allele in the male germline and that the recombined 

20 P2Br alleles and ProCre nucleic acid constructs segregate 
in the first generation. Because significantly more than 
25% of the progeny inherited recombined target alleles, 
recombination either occurred during diploid stages of 
spermatogenesis or Cre generated during haploid stages of 

2b spermatogenesis was distributed among spermatids through 
cytoplasmic bridges (Braun et al . (1989) Nature 
337:373-376), effecting recombination in spermatids that 
did not themselves contain a ProCre nucleic acid construct. 



The progeny of matings between ProCre males and -f/P2Bc 
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progeny examined by Southern blotting, none contained a 
Cre-recombined P2Br allele. 



It has also been discovered that a loxP- flanked neo 
cassette in the glutamate receptor R6 subunit locus is 
5 efficiently recombined by ProCre nucleic acid constructs in 
mice . 



Example 4 

ProCre Nucleic acid construct Expression is Highly 

Tissue- Specific 

10 Genomic DNAs from ten different tissues of five- to 

seven-week old males that contained both a ProCre nucleic 
acid construct and a P2Bc target allele were analyzed in 
Southern blots. Southern blots were prepared of Pst I 
digested DNA from testes (T) and one other tissue (K, 

15 kidney; B, brain; S, spleen) of males heterozygous for one 
of four ProCre nucleic acid constructs and the P2Bc allele. 
Testis DNA from each male shows a P2Br allele signal, in 
addition to those generated by the wild-type RP2 (WT) and 
P2Bc alleles. Other tissues show only the WT and P2Bc 

20 signals. Only the testis samples showed signal indicating 
Cre-mediated recombination of the target. The intensity of 
the P2Br signal relative to that of the wild-type allele 
ranged from 10% to 22% for different ProCre strains and did 
not correlate with the ProCre nucleic acid construct copy 

25 number. The copy number of ProCre nucleic acid constructs 
varied among lines showing similar levels of recombination 
in testis. For example, restriction patterns and 
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similar to results obtained with other mPl promoter-driven 
nucleic acid constructs {Peschon et al . (1987) Proc Natl 
Acad Sci USA 84:5316-5319; Zambrowicz et al . (1993) Proc 
Natl Acad Sci USA 90:5071-5075). 

5 As a more sensitive measure of ectopic recombination, 

PGR amplifications were performed on the same samples. The 
amplification primers were expected to produce a 325 bp 
product from the recombined target and a 1.4 kb fragment 
from the unrecombined allele (Figure 1) . The assay was 

10 expected to measure the cumulative level of recombination, 
for any P2Br alleles formed during transient expression of 
Cre during development would be preserved and perhaps 
amplified in descendant cells. Low levels of ectopic 
recombination product were observed in some tissues of all 

15 ProCre lines except for one. A southern blot of PGR 
amplification products of the P2Br allele utilized tissues 
from a male heterozygous for the ProGre nucleic acid 
construct and the P2Bc allele. DNA from 10 different 
tissues was amplified using primers and conditions that 

20 produced a 350 bp product from the recombined, P2Br allele. 
Each lane contains 10% of the reactions, except for the 
testis reactions, which were diluted 500 (T5) , 250 (T2), 
and 100 (Tl) fold prior to loading, and a liver 
reconstruction control (C) , which was diluted 1:100 before 

25 loading. The highest level of ectopic activity was 

observed in cardiac ventricular muscle of mice; in these 
samples the ectopic signal was more than 100 fold lower 
than that observed in testis. Three strains showed much 
lower levels of recombination in brain tissue, and 
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Example 5 

Isolation of HomQlogouslv Recombined ProCre ES Cell 
Clones Using TarQeting Vectors with a loxP-Flanked 

Selectable Marker 

5 Four male +/ProCre ES cell lines were established from 

129/Sv strain ProCre transgenic mice. In preliminary 
experiments, passage 5 cells from one of these lines {PC3) 
were used to generate three male chimeras with between 50 
and 95% coat color chimerism. In matings with C57BL/6 

10 females, two of these male chimeras have sired a total of 
11 pups, all bearing the Agouti coat color signifying 
germline transmission of the ES cell genome, and 6 of 9 
pups genotyped additionally contained the line 70 ProCre 
nucleic acid construct. The frequency of germline 

15 transmission has not yet been determined, nor has it been 
determined whether competency for germline transmission 
will persist in homologously recombined ProCre ES cells at 
later passages. 

To determine if homologously recombined ProCre ES cell 
20 clones could be isolated using targeting vectors that 
contained a loxP- flanked selectable marker, two 
transf actions were done using variants of a targeting 
vector in which a loxP- flanked neomycin ceissettc was 
inserted into an Nru I site in the hoxb 1 locus promoter 
2S (Figure 2) . A Southern blot of BamHI -digested genomic DNAs 
were harvested from a 96-well plate from 10 doubly- selected 
ES cell clones and hybridized with a probe (shown in Figure 
2) which is external to the targeting construct. All 
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PCs-derived clones that were ganciclovir and G418-resistant 
{Mansour et al. (1988) Nature 336:348-352) were found to be 
homologously recombined. In two parallel transf ections of 
CCE cells (Robertson et al . (1986) Nature 323:) with the 
5 same vectors, 32 of 93 (34%) and 15 of 132 (11%) clones 
were homologously recombined. The total numbers of 
G418-resistant clones recovered from ProCre ES cell 
transf ections were reduced relative to the parallel CCE 
transf ections . This may be attributable to both 

10 Cre-mediated excision of the neomycin cassette and to the 
fact that the transf ections were done under electroporation 
conditions optimized for CCE cells. 

Because it was formally possible that the homologously 
recombined clones contained inactive loxP sites, five 

15 homologously recombined PC3 ES cell clones and the parental 
PC3 cell line using the primers shown in Figure 2 were 
either mock transfected or transiently transfected with the 
pOG231 Cre expression vector. For the transient 

transfection assay, DNA was harvested 48 hours after 

20 transfection and used in PCR assays to assess whether the 
loxP sites in the recombinant clones could be recombined by 
Cre. In all cases a clear recombination signal was 
observed in the pOG231 transfected sample. The recombinant 
clones and parental cell lines show the 204 bp 

25 amplification product of the wild-type allele, and the 
recombinant clones additionally show a 1600 bp product 
(1600) resulting from amplification across the neomycin 
cassette and a nonspecific 1100 bp amplification product 
(NS) . The pOG231- transf ected recombinant clones show an 
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ProCre ES cells. Five recombinant clones were grown in the 
presence of G418 for two weeks, and then aliquots of each 
were grown either in the presence or absence of G418 for a 
further 10 days. PGR assays were performed to determine if 
5 Cre-recombined alleles were present in any of these samples 
and none was observed in the mock transfected controls. 
These data suggest that there is not enough Cre activity to 
significantly influence either the ability to isolate 
recombinant clones or the stability of the selectable 
10 markers in those clones, establishing that the loxP sites 
in these clones were functional. 



To determine if there was any detectable Cre activity 
in ProCre ES cells, aliquots of two lines {PC3 and PCS) 
were transiently transfected with the targeting vector used 

15 to create the P2Bc allele. DNA was recovered 4 8 hours 
after transfection and used for PCR amplifications of the 
P2Br plasmid molecules that would be generated by 
extrachromosomal Cre-mediated recombination. Small amounts 
of recombination product were seen in both ProCre ES cell 

20 transf ections , and none was observed in parallel samples of 
CCE ES cells. This shows that the ProCre ES cell lines 
express sufficient Cre to recombine some extrachromosomal 
targets when the latter are present at high copy numbers. 
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Example 6 
Plant DNA Constructs 

To define sequences in the IiAT52 and LAT5 9 promoters 
involved in expression in pollen, proximal promoters were 
5 constructed employing a series of linker substitution 
mutants using the particle bombardment system (Klein et al . 
(1987) Nature 327:70-73; Twell et al . (1989b) Plant Physiol 
91:1270-1274). These experiments were performed by co- 
bombarding the test pi asmids (lucif erase [LUC] - recombinase 
lO fusions) with reference plasmids (fi-glucuronidase [GUS] 
fusions) . The latter served as a control for bombardment 
variability and allowed comparisons to be made between 
independent bombardments . 

The context of the -100 promoter in LAT52 and the -115 
15 promoter in LAT5 9 was chosen because these promoters 
appeared to be the minimal regions that still conferred 
high levels (25% relative to the available full-length 
promoter) of pollen- specif ic expression (Twell et al . 
(1991) Gen Dev 5:496-507). These minimal promoters were 
20 then fused to the Cre coding sequence operatively linked to 
the luc gene (Ow et al . (1986) Science 234 : 856-858) coding 
region, and the resulting plasmids served as a basis for 
creating the nucleic acid constructs. The IiAT52 linker 
substitutions were performed in p52LUC, which contain 
25 entire 1lAT52 5' untranslated region (5' UTR) . A series of 
six 9- to 10 -bp- long linker substitutions were made in 
p52LUC, spanning the region -84 to -29 (52LS1 to 52LS6) . 
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Example 7 
Tissue Specificity in Plants 

The results obtained by transient expression in pollen 
and in transgenic plants provided information on the effect 
5 of the various constructs on expression in pollen but not 
on their effect on tissue specificity. A tobacco cell 
culture, TXD (maintained as described by Howard et al . 
(1992) Ceil 68:109-118), was, therefore, added as an 
additional component of the transient assay system. The 

10 TXD cell culture was initiated from tobacco mesophyll cells 
and therefore represents somatic tissue, as opposed to the 
gametophytic tissue represented by pollen. Cells in 
culture were chosen, rather than intact tissue, as the 
somatic tissue source because such cells superficially 

15 resemble pollen in that they can be spread out as a 
monolayer on a plate before bombardment . 

In this experiment, translation fusions between the 
luc coding region and either the CaMV 35S promoter drove 
strong expression in cell culture but negligible expression 

20 in pollen, whereas the LAT52 promoter showed the opposite 
pattern of strong activity in pollen and negligible 
activity in cell culture. Thus, the transient assay system 
mimics the oxpression pattern observed for these promoters 
in transgenic plants (Twell et al . (1991) Genes Dev 5:496- 

25 507) . This differential expression provided us with a tool 
with which to address tissue specificity. 

Example 8 



r.omato Khycopersicon t^sculentum cv Vr36) by Agrohacteriiun 
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tuxnefaciens LBA44 04 as previously described (McCormick 
(1991b) Transformation of tomato with Agrohacterium 
tuniefaciens , In Plant Tissue Culture Manual, K, Linsey, Ed 
B6:l-9). At least 20 independent transf ormants were 
5 obtained for each construct . 



For S-glucuronidase (GUS) assays, 5 to 20 of 
pollen, pooled from several flowers of the same plant, was 
ground directly in Eppendorf tubes in 50 to 100 /xL of GUS 
extraction buffer (Jefferson et al . (1987) EMBO 6:3901- 

10 3907) using a Teflon- tipped homogenizer driven by a drill. 
Expression in pollen was measured by f luorometrically 
assaying GUS activity in supernatants of pollen extracts 
using 2mM 4 -methylumbellif eryl S-D-glucuronide (Sigma) as 
substrate (Jefferson et al.(1987) EMBO 6:3901-3907). GUS 

15 activity was corrected for variation in total protein 
content using a bicinchoninic acid protein assay kit 
(Pierce, Rockford, IL) . 



Expression in leaves, flowers, stems, roots, and seed 
was tested histochemically by staining with 5-bromo-4- 
20 chloro-3 -indolyl S-D-glucuronide (Molecular Probes, Eugene, 
OR) as described previously (Jefferson et al.(1987) EMBO 

6:3901-3907). Expression in leaves was also analyzed 
f luorometrically as given previously. 



Example 9 

25 Transient Transformation of Tobacco Pollen 

and Cell Culture 



-5 
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TXD cell culture (maintained as described by Howard et al . 
(1992) Ceil 68:109-118) was spread out similarly as a 
monolayer (1 mL of a 50-mL stationary culture per plate) 
and bombarded as previously described. Between six and 12 
5 independent bombardments were performed for each construct . 
In each experiment, the test plasmid was co-bombarded with 
a reference plasmid: pB1223 (Clontech, Palo Alto, CA) was 
used for assays of all constructs in tobacco cell culture; 
pLAT59-12 (Twell et al . (1990) Development 109:705-713) for 

10 assays of LAT52 and LAT56 constructs in tobacco pollen; 
pLAT56-12 (Twell et al . (1990) Deveiop/nent 109:705-713) for 
assays of LAT59 constructs in tobacco pollen. Processing 
of the tissue after - 15 to 17 hr and analysis of GUS and 
LUC activity were as described previously (Twell et al . 

15 (1991) Genes Dev 5:496-507) . Transient expression was 
reported as "relative LUC activity," which represents the 
ratio between the test (LUC) and the reference (GUS) 
plasmids . 

While the invention has been described in detail with 
20 reference to certain preferred embodiments thereof, it will 
be understood that modifications and variations are within 
the spirit and scope of that which is described and 
claimed . 
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That which is claimed is: 



1. A nucleic acid construct comprising a germline- 
specific promoter operatively associated with a recombinase 
coding sequence . 



2 . A nucleic acid construct according to claim 1 
wherein said germline-specif ic promoter is the protamine 1 
gene promoter, the protamine 2 gene promoter, the 
spermatid-specif ic promoter from the c-kit gene, the sperm- 
specific promoter from angiotensin-converting enzyme, 
oocyte specific promoter from the ZPl gene, oocyte specific 
promoter from the ZP2 gene, or oocyte specific promoter 
from the ZP3 gene. 



3 . A nucleic acid construct according to claim 1 
wherein said germline-specif ic promoter is the LAT52 gene 
promoter from tomato, the LATS6 gene promoter from tomato, 
the LAT59 gene promoter from tomato, the pollen-specific 
promoter of the Brassica S locus glycoprotein gene, or the 
pollen- specif ic promoter of the NTP303 gene. 



4. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes Cre 
recombinase . 



b. A nucleic acid construct according to claim 4 
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6. A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes FLP 
recombinase . 



7. A nucleic acid construct according to claim 6 
wherein said construct is ProFLP, comprising the protamine 
1 gene promoter operatively associated with FLP 
recombinase . 



8 . A nucleic acid construct according to claim 1 
wherein said recombinase coding sequence encodes the R gene 
product of Zygosaccharomyces , 



9. A nucleic acid construct according to claim 8 
wherein said construct is ProR, comprising the protamine 1 
gene promoter operatively associated with the R gene 
product of Zygosa.ccha.romyces . 



10. A nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence . 



11. A nucleic acid construct comprising a tissue- 
specific promoter operatively associated with a recombinase 
coding sequence . 



12. Embryonic steni cells containing a nucleic acid 
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active selectable marker flanked by two recombination 
target sites. 



14. Embryonic stem cells according to claim 13 
wherein the recombinase encoded by the recombinase coding 
sequence operatively associated with a germline-specif ic 
promoter is selective for the recombination target sites 
flanking said selectable marker. 



15. Embryonic stem cells according to claim 13 
further comprising one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a tissue-specific 
promoter operatively associated with a recombinase coding 
sequence . 



16. Embryonic stem cells containing a nucleic acid 
construct according to claim 2 . 



17. 

construct 



Embryonic 
according 



stem cells 
to cl aim 3 . 



containing a nucleic acid 
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19. Embryonic stem cells containing a nucleic acid 
construct according to claim 5. 



20. Embryonic stem cells containing a nucleic acid 
construct according to claim 6 . 



21. Embryonic stem cells containing a nucleic acid 
construct according to claim 7 . 



22. Embryonic stem cells containing a nucleic acid 
construct according to claim 8 . 



23 . Embryonic stem cells containing a nucleic acid 
construct according to claim 9 . 



24 . Embryonic stem cells containing a nucleic acid 
construct according to claim 10. 



25. Embryonic stem cells according to claim 24 
wherein the genome thereof comprises a transcriptionally 
active selectable marker flanked by two recombination 

target, sites . 



26. Embryonic stem cells containing a nucleic acid 
construct according to claim 11 . 



2 7 . 



Embryonic stem cells according to claim 26 
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28. A method for excission of the transcriptionally 
active selectable marker from the embryonic stem cells of 
claim 13, said method comprising: 

passaging the genome derived from said embryonic stem 
cells through gametogenesis . 



29. A method according to claim 28 wherein said 
genome is passaged through spermatogenesis. 



30. A method according to claim 28 wherein said 
genome is passaged through oogenesis. 



31. A method according to claim 28 wherein said 
embryonic stem cells further comprise one or more of: 

a nucleic acid fragment flanked by two recombination 
target sites, wherein said recombination target sites are 
different than the recombination target sites which flank 
said selectable marker, 

a nucleic acid construct comprising a conditional 
promoter operatively associated with a recombinase coding 
sequence, or 

a nucleic acid construct comprising a t issue - sped I ic 
promoter operatively associated with a recombinase codin'.j 
sequence . 



32. A method for the production of recombinant 
alleles, said method comprising: 
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passaging the genome derived from said embryonic stem 
cells through gametogenesis . 



33. A method according to claim 32 wherein said 
nucleic acid fragment comprises an essential portion of a 
gene of interest . 



34. A method according to claim 32 wherein said 
nucleic acid fragment is introduced by homologous 
recombination, random insertion, retroviral insertion, or 
site specif ic -mediated recombination . 

35. A method for the production of recombinant 
alleles, said method comprising: 

introducing a nucleic acid fragment flanked by at 
least two recombination target sites into embryonic stem 
cells according to claim 13, and 

passaging the genome derived from said embryonic stem 
cells through gametogenesis. 

36. A method according to claim 35 wherein said 
embryonic stem cells further comprise a second nucieiic acid 
cronstruct selected from the group consisting of a construct 
comprising a conditional promoter operativeiy associated 
with a recombinase coding sequence and a construct 
comprising a tissue-specific promoter operativeiy 
associated with a recombinase coding sequence. 



i. p. ret sponse t o i nduc i nq c'oi id 1 1 ions 
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38. A method according to claim 36 wherein the 
recombinase encoded by said second construct is expressed 
in a tissue selective manner. 

39. A method according to claim 35 wherein the 
recombination target sites flanking said nucleic acid 
fragment are recognized by a recombinase which is expressed 
under the control of a conditional promoter or a tissue 
specific promoter . 

40. A method for the production of recombinant 
alleles, said method comprising: 

introducing at least one recombinase responsive construct 
into embryonic stem cells according to claim 10, 

wherein said construct (s) comprise (s) a nucleic 
acid fragment and a selectable marker, 

wherein said selectable marker is flanked by a 
first pair of recombination target sites, and 

wherein said nucleic acid fragment is flanked by 
a second pair of recombination target sites, 

passaging the genome derived from said e^mbryonic stem celJ.s 
through gametogencsis . 

41. A method according to claim 40 wherein said first 
pair of recombination target sites is recognized by a 
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which is expressed under the control of a conditional 
promoter or a tissue specific promoter. 



42. A method according co claim 40 wherein said 
embryonic stem cells further comprise a second nucleic acid 
construct selected from the group consisting of a construct 
comprising a conditional promoter operatively associated 
with a recombinase coding sequence and a construct 
comprising a tissue- specif ic promoter operatively 
associated with a recombinase coding sequence. 



43 . A method for the conditional assembly of 
functional gene(s) for expression in eukaryotic cells by 
recombination of individual inactive gene segments from one 
or more gene{s) of interest, 

wherein each of said segments contains at least one 
recombination target site, and 

wherein at least one of said segments contains at 
least two recombination target sites, 



said method comprising: 

introducing said individual inactive gene? 
segments into an embryonic stem cell according to 
claim 10, thereby providing a DNA which encoder a 
functional gene of interest, the expression product of 
which is biologically active, upon passage of the 
genome derivea from said stem cells through 
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cotnbining embryonic stem cells that include a nucleic 
acid construct according to claim l with host 
pluripotential ES cells derived from early preimplantat ion 
embryos , and 

introducing these combined embryos into a host female 

and 

allowing the derived embryos to come to term. 

45. A method for the generation of recombinant 
plants, said method comprising transforming plant zygotes 
with nucleic acid constructs according to claim 1 and 
allowing the zygote to develop. 



1/2 




SEQUENCE LISTING 



<110> O'Gorman, Steve 
Wahi, Geoffrey 

<120> Site-Specific Germline Recombination in 
Eukaryotes and Constructs Useful Therefor 

<130> Salk2190 

<150> 08/919,501 
<151> 1997-08-28 

<IG0> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 652 
<212> DNA 

<213> Mus musculus 

< 4 0 0 > 1 

gtctagtaat gtccaacacc tccctcagtc caaacactgc tctgcatcca tgtggctccc 60 

atttatacct gaagcacttg atggggcctc aatgttttac tagagcccac ccccctgcaa 120 

ctctgagacc ctctggattt gtctgtcagt gcctcactgg ggcgttggat aatttcttaa 180 

aaggtcaagt tccctcagca gcattctctg agcagtctga agatgtgtgc tttcacagtt 240 

acaaatccat gtggctgttt cacccacctg cctggccttg ggttatctat caggacctag 300 

cctagaagca ggtgtgtggc acttaacacc taagctgagt gactaactga acactcaagt 360 

ggatgccatc tttgtcactt cttgactgtg acacaagcaa ctcctgatgc caaagccctg 420 

cccacccctc tcatgcccat atttggacat ggtacaggtc ctcactggcc atggtctgtg 480 

aggtcctggt cctctttgac tt.cataattc ctaggggcca ctagtatcta taagaggaag 540 

399gtgctqg ctcccaggcc acagcccaca aaattccacc tgctcacagg ttggctggct 600 

cgacccaggt ggtgtcccct gctctgagcc agctcccggc caagccagca cc 652 



< 1 0 > 3 
<:vLi> 31 
<2L2> DNA 

<2L3> Artificial Sequence 
<40Q> 3 

ctctgagcca gctcccggcc aagccagcac c 31 



<210> 2 
<211> 29 

<2i::> DNA 

<213> Artificial Sequence 



<4 00> 2 

qt. ctagt .-tat, gtccaacacc t. ccctcagt 



<210> 4 



I- J 1 \ k' ' ' ' -;ajg.J f i ' 

• .:-."aat t t .1 f cijficcgf acM c'::aaaat -i 
.jaqgr, t. c:ac*a agaacctgat ggacatgttc 
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tggaaaatgc ttctgtccgt ttgccggtcg tgggcggcat ggtgcaagtg aataaccgga 240 

aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg 300 

gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt catcgtcggt 360 

ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 420 

aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 4 80 

tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata cgtaatctgg 540 

catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 600 

ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa 660 

cgctggctag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 720 

agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 780 

tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact cgcgccctgg 84 0 

aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 900 

gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 96 0 

ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta aatattgtca 1020 

tg 1022 



<?.10> 5 
<211> 2293 
<:il2> DNA 

<213> Artificial Sequence 



<400> 5 

gtctagtaat gtccaacacc tccctcagtc caaacactgc tctgcatcca tgtggctccc 60 

atttatacct gaagcacttg atggggcctc aatgttttac tagagcccac ccccctgcaa 120 

ctctgagacc ctctggattt gtctgtcagt gcctcactgg ggcgttggat aatttcttaa 180 

aaggtcaagt tccctcagca gcattctctg agcagtctga agatgtgtgc tttcacagtt 240 

acaaatccat gtggctgttt cacccacctg cctggccttg ggttatctat caggacctag 300 

cctagaagca ggtgtgtggc acttaacacc taagctgagt gactaactga acactcaagt 360 

ggatgccatc tttgtcactt cttgactgtg acacaagcaa ctcctgatgc caaagccctg 420 

cccacccctc tcatgcccat atttggacat ggtacaggtc ctcactggcc atggtctgtg 480 

aggtcctggt cctctttgac ttcataattc ctaggggcca ctagtatcta taagaggaag 540 

agggtgctgg ctcccaggcc acagcccaca aaattccacc tgctcacagg ttggctggct 6 00 

cgacccaggt ggtgtcccct gctctgagcc agctcccggc caagccagca cccgggacca 660 

tggagcaaaa gctgatttct gaggaggatc tgggaggacc caagaagaag aggaaggtgt 72 0 

ccaatttact gaccgtacac caaaatttgc ctgcattacc ggtcgatgca acgagtgatg 780 

aggttcgcaa gaacctgatg gacatgttca gggatcgcca ggcgttttct gagcatacct 84 0 

ggaaaatgct tctgtccgtt tgccggtcgt gggcggcatg gtgcaagttg aataaccgga 900 

aatggtttcc cgcagaacct gaagatgttc gcgattatct tctatatctt caggcgcgcg 960 

gtctggcagt aaaaactatc cagcaacatt tgggccagct aaacatgctt catcgtcggt 102 0 

ccgggctgcc acgaccaagt gacagcaatg ctgtttcact ggttatgcgg cggatccgaa 1080 

aagaaaacgt tgatgccggt gaacgtgcaa aacaggctct agcgttcgaa cgcactgatt 114 0 

tcgaccaggt tcgttcactc atggaaaata gcgatcgctg ccaggatata cgtaatctgg 1200 

catttctggg gattgcttat aacaccctgt tacgtatagc cgaaattgcc aggatcaggg 12(-.0 

ttaaagatat ctcacgtact gacggtggga gaatgttaat ccatattggc agaacgaaaa 132') 

cgctggttag caccgcaggt gtagagaagg cacttagcct gggggtaact aaactggtcg 13 80 

agcgatggat ttccgtctct ggtgtagctg atgatccgaa taactacctg ttttgccggg 1440 

tcagaaaaaa tggtgttgcc gcgccatctg ccaccagcca gctatcaact cgcgccctgg 150 ) 

aagggatttt tgaagcaact catcgattga tttacggcgc taaggatgac tctggtcaga 1560 

gatacctggc ctggtctgga cacagtgccc gtgtcggagc cgcgcgagat atggcccgcg 1620 

ctggagtttc aataccggag atcatgcaag ctggtggctg gaccaatgta aatattgtca 1680 

tgaactatat ccgtaacctg gatagtgaaa caggggcaat ggtgcgcctg ctggaagatg 174 0 

gcgattagcc attaacgcgt aaatgattgc tataattatt tgatatttat ggtgacatat 1800 

gagaaaggat ttcaacatcg acggaaaata tgtagtgctg tctgtaagca ctaatattca 18h0 

qtcgccagcc gacattgtca ctgtaaagct gagcgataga atgcctgata ttqactcaat 19?,0 



jaggggat c-j tj<:aa t.aaaaa r.jacagaal a a aacqra::gq<i t -it Vgggtc-q r. t * ' rqr.M 2 -i ' 

' f : q a t c c: q * ; * q <"i c 2 2'.f ' 
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<210> 6 
<?,11> 86 
<.:12> UNA 

<i:i3> Artificial Sequence 
<400> 6 

cccgggatca attcaccatg ggaataactt cgtatagcat acattatacg aagttatgga 60 

tccgccgcta tcaggacata gcgttg 86 

c:2io> 7 

<211> 4172 
<2ir.> DNA 

<213> Artificial Sequence 
<400> 7 

gcacttttcg gggaaatgtg cgcggaaccc ctatttgttt atttttctaa atacattcaa 60 

atatgtatcc gctcatgaga caataaccct gataaatgct tcaataatat tgaaaaagga 120 

agagtatgag tattcaacat ttccgtgtcg cccttattcc cttttttgcg gcattttgcc 180 

ttcctgtttt tgctcaccca gaaacgctgg tgaaagtaaa agatgctgaa gatcagttgg 240 

gtgcacgagt gggttacatc gaactggatc tcaacagcgg taagatcctt gagagttttc 3 00 

gccccgaaga acgttttcca atgatgagca cttttaaagt tctgctatgt ggcgcggtat 360 

tatcccgtat tgacgccggg caagagcaac tcggtcgccg catacactat tctcagaatg 420 

acttggttga gtactcacca gtcacagaaa agcatcttac ggatggcatg acagtaagag 4 80 

aattatgcag tgctgccata accatgagtg ataacactgc ggccaactta cttctgacaa 54 0 

cgatcggagg accgaaggag ctaaccgctt ttttgcacaa catgggggat catgtaactc 600 

gccttgatcg ttgggaaccg gagctgaatg aagccatacc aaacgacgag cgtgacacca 66 0 

cgatgcctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa ctacttactc 720 

tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca ggaccacttc 78 0 

tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc ggtgagcgtg 84 0 

ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt atcgtagtta 900 

tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc gctgagatag 96 0 

gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat atactttaga 1020 

ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt tttgataatc 1080 

tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac cccgtagaaa 1140 

agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa 1200 

aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca actctttttc 1260 

cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta gtgtagccgt 1320 

agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct ctgctaatcc 1380 

tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg gactcaagac 1440 

gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc acacagccca 1500 

gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcta tgagaaagcg 1560 

ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg gtcggaacag 162 0 

gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt cctgtcgggt 16R0 

ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg cggagcctat 1740 

ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg ccttttgctc 1800 

acatgttctc tcctgcgtta tcccctgatt ctgtggataa ccgtattacc gcctttgagt 18C0 

gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg agcgaggaag 1920 

cggaagagcg cccaatacgc aaaccgcctc tccccgcgcg ttggccgatt cattaatgca 1980 

gctggcacga caggtttccc gactggaaag cgggcagtga gcgcaacgca attaatgtga r^040 

gttagctcac tcattaggca ccccaggctt tacactttat gcttccggct cgtatgttgt ::ino 

gtggaattgt gagcggataa caatttcaca caggaaacag ctatgaccat gattacgcca 2160 

agctcgaaat Laaccctcac taaagggaac aaaagctggg tacgaattca gatctcccgg 221>0 

gatcaattca ccatgggaat aacttcgtat agcatacatt atacgaagtt atggatccgg 2280 

tcgagcagtg tggttttgca agagqaaqca aaaaqcctct: ccacccaggc ctqqaatqtt :M4 0 



4a cacaa "a .1 acaat,cggct acr.ctqrirqc rgcrgtgt t cggctgtcaq c<^c:aggggc:'j 

:"'r:cggt t ct tM.gtcaaga ccgacctgt c: cggtgccctg aatigaactgc aggacgaggc 27»S 
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cactgaagcg ggaagggact ggctgctatt gggcgaagtg ccggggcagg atctcctgtc 2880 

atctcacctt gctcctgccg agaaagtatc catcatggct gatgcaatgc ggcggctgca 2940 

tacgcttgat ccggctacct gcccattcga ccaccaagcg aaacatcgca tcgagcgagc 3000 

acgtactcgg atggaagccg gtcttgtcga tcaggatgat ctggacgaag agcatcaggg 3060 

gctcgcgcca gccgaactgt tcgccaggct caaggcgcgc atgcccgacg gcgaggatct 3120 

cgtcgtgacc catggcgatg cctgcttgcc gaatatcatg gtggaaaatg gccgcttttc 3180 

tggattcatc gactgtggcc ggctgggtgt ggcggaccgc tatcaggaca tagcgttggc 324 0 

tacccgtgat attgctgaag agcttggcgg cgaatgggct gaccgcttcc tcgtgcttta 3 300 

cggtatcgcc gctcccgatt cgcagcgcat cgccttctat cgccttcttg acgagttctt 3360 

ctgaggggat cggcaataaa aagacagaat aaaacgcacg ggtgttgggt cgtttgttcg 3420 

gatagggatc aattcaccat gggaataact tcgtatagca tacattatac gaagttatgg 34 80 

atccactagt tctagagcgg ccgccaccgc ggtggagctc caattcgccc tatagtgagt 3 54 0 

cgtattacaa ttcactggcc gtcgttttac aacgtcgtga ctgggaaaac cctggcgtta 3 6 00 

cccaacttaa tcgccttgca gcacatcccc ctttcgccag ctggcgtaat agcgaagagg 3660 

cccgcaccga tcgcccttcc caacagttgc gcagcctgaa tggcgaatgg gacgcgccct 3720 

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg 3780 

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg 3840 

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac 3900 

ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg ccatcgccct 3960 

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt 4020 

tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta taagggattt 4080 

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt 414 0 

ttaacaaaat attaacgctt acaatttagg tg 4172 

<210> 8 
<211> 34 
<212> DNA 

<213> Artificial Sequence 



<400> 8 

ataacttcgt atagcataca ttatacgaag ttat 
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